AIbase
Home
AI Tools
AI Models
MCP
AI NEWS
EN
Model Selection
Tags
Expanded Vocabulary

# Expanded Vocabulary

Ruri V3 30m
Apache-2.0
Ruri v3 is a Japanese general-purpose text embedding model based on ModernBERT-Ja, supporting sequence processing of up to 8192 tokens and delivering top-tier performance in Japanese text embedding tasks.
Text Embedding Japanese
R
cl-nagoya
1,135
3
Mistral 7B V0.3
Apache-2.0
Mistral-7B-v0.3 is an upgraded large language model based on Mistral-7B-v0.2, with the main improvement being the expansion of the vocabulary to 32,768 tokens.
Large Language Model Transformers
M
mistralai
442.55k
472
Llama 2 Ko 7b
LLaMA-2-Korean Version is an advanced iteration of LLaMA 2, optimized for Korean text generation tasks through vocabulary expansion and additional Korean corpus pre-training.
Large Language Model Transformers Supports Multiple Languages
L
beomi
3,451
175
Featured Recommended AI Models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
English简体中文繁體中文にほんご
© 2025AIbase